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- The MAILING DATE of this communication appears on the cover sheet with the correspondence address- 
All claims being allowable, PROSECUTION ON THE MERITS IS (OR REMAINS) CLOSED in this application. If not included 
herewith (or previously mailed), a Notice of Allowance (PTOL-85) or other appropriate communication will be mailed in due course. THIS 
NOTICE OF ALLOWABILITY IS NOT A GRANT OF PATENT RIGHTS. This application is subject to withdrawal from issue at the initiative 
of the Office or upon petition by the applicant. See 37 CFR 1 .31 3 and MPEP 1 308. 

1 . ^ This communication is responsive to 5/4/2006 . 

2. E3 The allowed claim(s) is/are 28-40. 55-61 (Renumbered claims 1-20) . 

3. □ Acknowledgment is made of a claim for foreign priority under 35 U.S.C. § 1 19(a)-(d) or (f). 

a) □ All b) □ Some* c) □ None of the: 

1 . □ Certified copies of the priority documents have been received. 

2. □ Certified copies of the priority documents have been received in Application No. . 

3. □ Copies of the certified copies of the priority documents have been received in this national stage application from the 

International Bureau (PCT Rule 17.2(a)). 
* Certified copies not received: . 

Applicant has THREE MONTHS FROM THE "MAILING DATE" of this communication to file a reply complying with the requirements 
noted below. Failure to timely comply will result in ABANDONMENT of this application. 
THIS THREE-MONTH PERIOD IS NOT EXTENDABLE. 

4. □ A SUBSTITUTE OATH OR DECLARATION must be submitted. Note the attached EXAMINER'S AMENDMENT or NOTICE OF 

INFORMAL PATENT APPLICATION (PTO-152) which gives reason(s) why the oath or declaration is deficient. 

5. □ CORRECTED DRAWINGS ( as "replacement sheets") must be submitted. 

(a) □ including changes required by the Notice of Draftsperson's Patent Drawing Review ( PTO-948) attached 

1) □ hereto or 2) □ to Paper NoVMail Date . 

(b) □ including changes required by the attached Examiner's Amendment / Comment or in the Office action of 

Paper NoVMail Date . 

Identifying indicia such as the application number (see 37 CFR 1.84(c)) should be written on the drawings in the front (not the back) of 
each sheet. Replacement sheet(s) should be labeled as such in the header according to 37 CFR 1.121(d). 
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DETAILED ACTION 
EXAMINER'S AMENDMENT 

1. An examiner's amendment to the record appears below. Should the changes and/or 
additions be unacceptable to applicant, an amendment may be filed as provided by 37 CFR 
1.312. To ensure consideration of such an amendment, it MUST be submitted no later than the 
payment of the issue fee. 

Authorization for this examiner's amendment was given in a telephone interview with 
attorney Mohammad S. Rahman (Reg. No. 43,029) the undersigned for Applicant(s) on 3/27/07. 

The application has been amended as follows: 
In the Specification; 

Please replace pages 18, 26, and 27 in the original specification by the new attached pages 

18, 26, and 27 in order to be consistent with equation number 6 on page 18, the last two equation 

on page 26, and equation number 8 on page 27. See the attached papers. 

Conclusion 

2. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Jacques Veillard whose telephone number is (571) 272-4086. 
The examiner can normally be reached on Mon. to Fit from 9 AM to 4:30 PM, alt. Fri. off. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Jeffrey Gaffin can be reached on (571) 272- 4146. The fax phone number for the 
organization where this application or proceeding is assigned is 571-273-8300. 



Application/Control Number: 10/660,242 Page 3 

Art Unit: 2165 

Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would 
like assistance from a USPTO Customer Service Representative or access to the automated 
information system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 
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Intuitively, the objective function measures the combined coherence of all the k clusters. 
The Process. The invention's objective is to find k disjoint clusters 

1 ' 2 ' ' 3 such that the following is maximized 



Even when only one of the parameters ow, a/, or ab is nonzero, finding the optimal 
solution to the above maximization problem is known to be NP-complete. Discussed 
below is an efficient and effective approximation process: the toric k-means that may be 
thought of as a gradient ascent method. 

Step 1. Start with an arbitrary partitioning of the document vectors, namely, 



partitioning. Set the index of iteration t = 0. The choice of the initial partitioning is quite 

crucial to finding a "good" local minima; for recent work on this area, see (Bradley, P., 

and Fayyad, U. Refining initial points for k-means clustering In ICML (1998), pp. 91-99, 

incorporated herein by reference). 

Step 2. For each document vector triplet xi 9 1 [ i [ n the invention finds the 

concept triplet that is closest to xl Now, for 1 [j [ k, compute the new 

1 ^ * ^ 
partitioning X '' - 1 - n 




(6) 




denote the concept triplets associated with the given 
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Throughout this section, the invention fixes the number of clusters k >2. As 
before, let ota , (Xf , and abbe nonnegative numbers that sum to 1. Geometrically, these 
parameters lie on a planar triangular region, say, Ao, that is shown in Figure 5. For 

(•if 

brevity, the invention writes a = (ad , (Xf , a*,). Let n(a) = *J s} , denote the 
5 partitioning obtained by running the toric £-means process with the parameter values aa , 
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//(a) : a e Aq 



The invention selects 



af , and at>. From the set of all possible clusterings 
a partitioning that yields the best cluster annotations. Towards this goal, the invention 
introduces a figure-of-merit for evaluating and comparing various clusterings. 

Fixing a clustering 11(a). For the given clustering, the summary, which is a 
10 descriptive characteristic, for each of the clusters will be good if each cluster is as 
coherent as possible in the word feature space, that is, if the following is maximized: 

k 

J=lX€7T i 

where jc = (Z), F, B). Furthermore, the keywords, which are a discriminative 

characteristics, will be good if the following is minimized: 

k k 
A ,(*)= A ,(//(*)) = II I D?D* 
a a j= lXenJ= j J 1 

where x = (Z), F, B). Intuitively, r d (a) and A d (a) capture the average within cluster 
coherence and average between cluster coherence, respectively, of the 
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clustering n(a) in the word feature space. The summary and the keywords both will be 
good if the following ratio is maximized: 



5 where m denotes the number of document triplets in Q that have a non-zero word feature 



separated in the word feature space. 

The quantities Tf(a), A f(a), Y b (a), A b (a), Qf(a), and Qb{a) are defined in a 
similar fashion. The quantity gf(a)should be maximized to obtain good quality review 
10 and references, and the Quantity Qb(a) should be maximized to obtain good quality 
breakthrough and citations. 

In Figure 5 the triangular region A 0 is formed by the intersection of the plane ad + 
afr a b = 1 with the nonnegative orthant of R 3 . The left-vertex, the right-vertex, and the 
top-vertex of the triangle correspond to the points (1, 0, 0), (0, 1, 0), and (0, 0, 1), 
15 respectively. 

The following shows how the invention selects the optimal parameter tuple a\ and 
the corresponding clustering II( a*). 

Step 1. Theoretically, the invention would like to run the tortic k-means process 
for every parameter triplet in: 




if A d (a)> 0, 



if Ad(a)=0, 



(8) 



vector; see, for example, Figure 3. In the case that Ad(a) = 0, the clusters are perfectly 
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(9) 
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